Adaptation Strategies for Automated Machine Learning on Evolving Data

نویسندگان

چکیده

Automated Machine Learning (AutoML) systems have been shown to efficiently build good models for new datasets. However, it is often not clear how well they can adapt when the data evolves over time. The main goal of this study understand effect stream challenges such as concept drift on performance AutoML methods, and which adaptation strategies be employed make them more robust. To that end, we propose 6 evaluate their effectiveness different approaches. We do a variety approaches building machine learning pipelines, including those leverage Bayesian optimization, genetic programming, random search with automated stacking. These are evaluated empirically real-world synthetic streams types drift. Based analysis, ways develop sophisticated robust techniques.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Machine Learning Models for Housing Prices Forecasting using Registration Data

This article has been compiled to identify the best model of housing price forecasting using machine learning methods with maximum accuracy and minimum error. Five important machine learning algorithms are used to predict housing prices, including Nearest Neighbor Regression Algorithm (KNNR), Support Vector Regression Algorithm (SVR), Random Forest Regression Algorithm (RFR), Extreme Gradient B...

متن کامل

Automated Machine Learning on Big Data using Stochastic Algorithm Tuning

We introduce a means of automating machine learning (ML) for big data tasks, by performing scalable stochastic Bayesian optimisation of ML algorithm parameters and hyper-parameters. More often than not, the critical tuning of ML algorithm parameters has relied on domain expertise from experts, along with laborious handtuning, brute search or lengthy sampling runs. Against this background, Bayes...

متن کامل

Strategies and Principles of Distributed Machine Learning on Big Data

The rise of Big Data has led to new demands for Machine Learning (ML) systems to learn complex models with millions to billions of parameters, that promise adequate capacity to digest massive datasets and offer powerful predictive analytics (such as high-dimensional latent features, intermediate representations, and decision functions) thereupon. In order to run ML algorithms at such scales, on...

متن کامل

Single-molecule Biophysics: Machine Learning for Automated Data Processing

Single-molecule fluorescence microscopy has been greatly successful in understanding biophysics at molecular level. This technique has been advancing toward higher throughput, which creates a need for a data-analysis tool to distinguish molecule of interest from other fluorescence signals. Here, we have used supervised machine-learning approaches to filter biological events of our interest, and...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Pattern Analysis and Machine Intelligence

سال: 2021

ISSN: ['1939-3539', '2160-9292', '0162-8828']

DOI: https://doi.org/10.1109/tpami.2021.3062900